Journal: Nucleic Acids Research
Article Title: A practical, bioinformatic workflow system for large data sets generated by next-generation sequencing
doi: 10.1093/nar/gkq667
Figure Lengend Snippet: The 20 most represented (InterPro) protein domains inferred from peptides conceptually translated from individual contigs for Oesophagostomum dentatum [combined assembly of data for adult female, adult male, and the third (L3) and fourth (L4) larval stages] and InterPro protein domains (level 1) assigned to predicted peptides unique to each stage or sex following in silico subtraction
Article Snippet: The numbers of peptides inferred from sequence clusters in the adult female, adult male and/or L4 data sets, which could be assigned InterPro and/or GO terms, are given in In total, 85 395 peptides were predicted for all sequences from all four data sets, representing 17.5% of clusters ( ); 56 940 (66.7%) of them could be mapped to known proteins defined by 31 982 different domains, the most represented being ‘SCP-like extracellular’ (IPR014044; 1.2% of the peptides mapping to a conserved protein motif), ‘NAD(P)-binding’ (IPR016040; 1.1%) and ‘proteinase inhibitor I2, Kunitz metazoa’ (IPR002223; 1%) ( ).
Techniques: In Silico, Protease Inhibitor, Binding Assay, Ligand Binding Assay